Research on the Deep Deterministic Policy Algorithm Based on the First-Order Inverted Pendulum

نویسندگان

چکیده

With the mature development of artificial intelligence technology, application intelligent control algorithms in systems has become a trend to meet high-performance requirements modern society. This paper proposes deep deterministic policy gradient (DDPG) controller design method based on reinforcement learning improve system performance. Firstly, optimal DDPG algorithm is derived from Markov decision process and Actor–Critic algorithm. Secondly, order avoid local optima traditional systems, capacity settlement experience pool are adjusted absorb positive accelerate convergence complete efficient training. In response, solve overestimation Q value DDPG, overall structure Critic network changed shorten period at low rates. Finally, first-order inverted pendulum was constructed simulation environment verify effectiveness PID, improved DDPG. The results reveal that faster response disturbances, smaller displacement, angular displacement pendulum. further proves better stability stronger anti-interference ability recovery. provides certain reference for systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

on the status of mixed methods research in applied linguistics

چکیده ندارد.

15 صفحه اول

survey on the rule of the due & hindering relying on the sheikh ansaris ideas

قاعده مقتضی و مانع در متون فقهی کم و بیش مستند احکام قرار گرفته و مورد مناقشه فقهاء و اصولیین می باشد و مشهور معتقند مقتضی و مانع، قاعده نیست بلکه یکی از مسائل ذیل استصحاب است لذا نگارنده بر آن شد تا پیرامون این قاعده پژوهش جامعی انجام دهد. به عقیده ما مقتضی دارای حیثیت مستقلی است و هر گاه می گوییم مقتضی احراز شد یعنی با ماهیت مستقل خودش محرز گشته و قطعا اقتضاء خود را خواهد داشت مانند نکاح که ...

15 صفحه اول

the role of task-based techniques on the acquisition of english language structures by the intermediate efl students

this study examines the effetivenss of task-based activities in helping students learn english language structures for a better communication. initially, a michigan test was administered to the two groups of 52 students majoring in english at the allameh ghotb -e- ravandi university to ensure their homogeneity. the students scores on the grammar part of this test were also regarded as their pre...

15 صفحه اول

the u.s. policy in central asia and its impact on the colored revolutions in the region (the case study of tulip revolution in kyrgyzstan)

چکیده ندارد.

15 صفحه اول

on the effects of pictorial clues on the efl learners listening comprehension development

the following null hypothesis was proposed: there is no significant difference between the efl students listening comprehension development receiving pictorial cues and those receiving no cuse. to test the null hypothesis, 52 male and femal freshmen students of medicine studing at iran university of medical scinces were randomly selected from a total population of 72 students. to ensure that th...

15 صفحه اول

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2023

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app13137594